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The transcription of protein encoding genes and 
their regulation in plant cells shows the signature of 
a typical eukaryotic system. The major outcome of 
research in this area is recognition of variability in 
components assembled to design constitutive or in- 
ducible promoters, including those constituting en- 
hancers and boundary elements. Such cw-acting 
elements respond to diverse intrinsic and extrinsic 
signals to take care of the needs of sessile plants- 
While much remains to be defined about the basic 
transcription machinery in plants, it is expected to 
follow the pattern as deciphered in animals. At the 
same time, a large number of genes for regulatory 
proteins have been cloned leading to information 
about DNA binding domains, activation domains, 
nuclear localization signals and oligomerizution sites. 
However, most of this information needs to be tested 
by experimentation. Avaaability of transgenic sys- 
tems in plants and hiotechnological needs have given 
impetus to research for designing tailor-made induc- 
iblc/rcpressible promoters and to manipulate plant 
u,. ji^^\*^ww^^rr i-w^9%c/*w^tw\i-{n*ii fa'^tors. While 

considerable progress has been made, much remains 
to be learned about the biochemical and molecular 
basis of regulation of pl ant gene expression. 

THE drama of life is enacted by 'genes' which collec- 
tively act as a vast repository of information and 'ele- 
ments' which regulate their activity. It is in the early 
1960s, that the basic principles of gene regulation in 
prokiu-yotes were established', leading some years later 
to a theory of regulation of eukaryotic genes that 
evolved in the late sixties^ While in prokaryotes» the 
units of gene regulation were defined as promoters, op- 
erators and positive control elements, the analysis of 
eukaryotic promoters has to wait until the advent of 
recombinant DNA technology in 1970s which then led 
10 the recognition of both positive and negative regula- 
lory elements (enhancers and silencers) and in addition 
boundary elements. All these elements interact with 
proteins (repressors or activators) and RNA poly- 
merase(s) to establish the dynamic state of regulation of 
gene expression. Notwithstanding the universal nature 
of components and basic principles, some fundamental 
differences have emerged in prokaryotic and eukaryotic 
gene regulation^ While in prokaryoies the ground state 
is nonrestrictive and regulation is mostly approachable 
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by repression, in eukaryotes the ground state is restric- 
tive due to packaging of DNA in chromatin and» there- 
fore, regulation has to be accessed mainly by activation. 
Despite the gross level differences, tlie mechanism in- 
volved in gene expression reflects the same paltera, of 
course, with increasing diversity and complexity. 

Plants, by and large, also seem to follow typical eu- 
karyotic norms for gene expression and regulaiioiv Be- 
ginning in the late seventies, plant genes have been 
characterized in great detail. The sequencing of the 
complete genome oT Arabidopsis is nearing completion 
and that of the rice genome is also progressing rapidly *. 
Based on information from these studies and alsti data 
on non-redundant ESTs, it is estimated that planLs may 
contain approximately 25,000 to 50,000 genes. U has 
also been estimated that about one-third of these genes 
express in all the organs, albeit at variable levels. An- 
other one-third may have their products present in a 
few, but not all, organs and the rest may belong to one 
or other unique organ as far as expression is concerned- 

iiliA veil iuuiiti.jr lo Awin^vwv* '■"^ sy*^— '-^-^w.. 

moters and regulatory elements as well as in genes char- 
acterised for the regulating factors. In this article, an 
effort has been made to give a bird's eye view of gene 
expression in plants. The information is restricted 10 the 
genes producing o.nly mRNAs. Finally, emerging global 
trends in plant gene expression and the utility of regu- 
lated gene expression for biotechnological purposes has 
been taken up briefly. For obvious reasons, it is not pos- 
sible to discuss all aspects and readers are advised to 
consult other references for details^^^, 

Basic mechanism of regulation of gene 
expression 

Core promoters of eukaryotic protein-coding genes are 
defined as possessing TATA box, an initiator element 
as well as a transcription initiation site^ RNA poly- 
merase II and general transcription factors, GTFs (TBP, 
TFIIB. TFIIE. TFTIF, TFIIH) assemble at the core pro- 
moter and carry out accurate transcription in vitro. 
Other than the core elements, promoters also contain 
enhancer/regulatory elements which interact with acti- 
vators in a sequence-specific manner. Since enhancersi 
can act at a distance in bodi an orientation and position 
independent manner and chromatin organization can 
sequester distant regions together, it is necessary to de- 
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fine the boundary of aciion of enhancers. It has been 
found that certain sequences like the scaffold attach- 
ment regions (SARs), boundary elements (BEsS) and 
locus control regions (LCRs) and their interactions with 
proteins nriay help delimit functional domains of chro- 
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Another class of proteins^ called coactivators or me- 
diators, helps establish communication between activa- 
tors and the RNA polymerase U machinery^ Certain 
coactivators like SWI/SNF act as chromatin remodel- 
ling factors. Others may have histonc acetyltransferase 
activity (e.g. SAGA). It has been shown in yeast that a 
sequence-specific transcription factor SWI5 recruits 
SWI/SNF, which in turn recruits SAGA. Both are re- 
quired to recruit the sequence-specific factor, SW]4- 
SWI6, which in turn may help assemble the pre- 
initiation complex to activate the HO gene (coding for 
an endonuclcasc). Several other coaciivator complexes 
in yeast (e.g. SRB/MED) and their homologues in 
mammalian cells have been shown to mediate transcrip- 
tion through more than one activator proteins. TAFs 
(TBP-associaied factors in TFIID) were the first to be 
identified as coactivators. But, they seem to perform 
such a role mostly in meiazoan systems and not in 
yeast. In mammalian systems, however* TAFs seem to 
act in concert with other coactivators. It is possible that 
such observations reflect redundancy or division of la- 
bour among coactivators. This is also supported by 
shared subunits among different coactivator com- 
plexes". 

Turning to the mechanism of action, the coactivators 
mediate between activators and transcription machinery. 
The mechanistic steps in the process may include re- 
modelling of chromatin, recruitment of additional acti- 
vators, and/or direct interaction with RNA polymerase 
ri or GTFs (Figure I). Several coactivators (SRB/MED, 
SWI/SNF) have been found to be components of RNA 
polymerase II holoenzyme, raising the possibility of 
transcriptional activation by recruitment*^, This is also* 
supported by evidence for the pre.ience of a giant com- 
plex 'transcriptosome* which may include RNA poly- 
merase II, some GTFs and coactivators' ^ One may, 
therefore, visualize an immobile 'transcriptosome' and 
DNA moving through it for transcription and beyond'"! 

Plant promoters and regulatory elements 

In plants, the modern era of work on genes and their 
regulation began after the discovery of restriction cn- 
donucleases, advent of recombinant DNA revolution 
and arrival of DNA sequencing technology. These en- 
abled the isolation and sequencing of first genes such as 
those coding for large and small subunii of Rubisco (by 
groups of L, Bogorad, R. B, Meagher and N.-H. Chua), of 
leghaemoglobin (by P. S. Verma and K. A. Marcker) 



Sifc 



Figure 1. TranscHptionnl activation involves lqit prumoiur and 
enhancer elements. The uore promoter inreracrs with TBP and rhc 
enhancer elements interact with activator proteins A. B find C Acii- 
vator proteins inlcracl with conctivator3 (SWI/SNF, SAGA, 
SRB/MED, TAFs), general iranscHption fticiofs (TFTID, GTF,s) iiiid 
RNA polymcra.sc 11 (after Wu and Hampsey**). 



or the seed storage proteins (by R, Goldberg and T. 
Hall). Because of the uniqueness of the process of pho- 
tosynthesis in plants and early lead with genes encoding 
components of thylakoids and stroma, genes for suclt 
proteins have continued to attract attention of leading 
groups (like R. G. Herrmann, J. C. Gray. M. Sui.dura, 
N.-H. Chua and T. Cashmore). Recently, emphasis has 
shifted to characterize components of regulatory system 
and transcription machinery. 

The understanding of regulation of plant gene expres- 
sion at ciivaciing elements started with analysis of pro- 
moters from native plant genes and genes from viruses 
and Agrobacterium which express in plant cells''^. Ii 
was found that, like their counterparts in animals, plnni 
gene promoters contain TATA box, initiator clement as 
well as well-defined transcription initiation site Lihoui 
20 to 30 bp downstream to TATA element. In the up- 
stream region, several gene promoters were found to 
contain positive or negative regulatory elements, some 
of which were characterized as enhancers or silencers' 
The gain-of-funciion assay mostly served as evidence 
for the activity of an clement. Thus, a minimal promotei 
(mostly -46 bp derivative of 35S CaMV promoter or iis 
-90 bp derivative) is fused to a heterologous promoter 
sequence at its 5' end and to a reporter gene (gus, gfp, 
luc, cat) at its 3' end. Such constructs are introduced in 
plant cells by Agrobacterium^ electroporation or biolis- 
lics and evaluated either for transient expression of llic 
reporter gene or for its expression in stable transgenic 
system- The pattern of expression conferred by het- 
erologous promoter sequences has been helpful in de- 
fining characteristics of developmental (spatial and 
temporal) as well as inducible regualtory promolers'"*''^ 
A few major findings from such experimenLs are dis- 
cussed below. 
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Constitutive^ promoters 

A few promoters, found to express in most of ihe cells^ 
of plants albeU_at-variableJevcls,_havc been termed 
constitutive promoters. These include promoters like 
355.from CaMV. the octo pine synthase, nopaline syn- 
thase and, V2' mannopine synthase gene promoters 
{tquv AgrobacTerium, as well as promoters from actin I 
and ubiquitin genes*^"^l Investigations on the 35S pro- 
moter have served as paradigm for the analysis of activ- 
ity of constitutive promoiers^'^^ The 555 promoter has 
been divided into two major domains, A and B. Domain 
A contains the minimal promoter |(+8 to ^6 bp) and Al 
domain (-46 to -90 bp) characterized by the presence of 
as-] sequence. The B domain (-90 to -343 bp) has been 
defined as an enhancer and further divided into five 
subdomains, BI-B5. A combination of different B sab- 
domains with minimal promoter or domain A revealed 
the modular organization of the enhancer element. 
Thus, different combinations gave distinct expression 
patterns of reporter gene activity in early and mature 
stages of plant development, unravelling synergistic 
interactions among various elements within the 35S en- 
hancer» which ultimately resuh in constitutive expres- 
sion under control of the full promoter. 

Due to variable and weak expression of different con- 
stitutive promoters in certain tissues in transgenic 
plants, one can combine various modules to obtain ap- 
propriate constitutive expression. Thus, use of double 
35S enhancer resulted in about 6-foid higher expression 
compared to single enhancer. Another 6-fold increase in 
expression became possible" by combining +65 to - 
301 bp of mannopine synthase gene promoter and -90 
to -941 bp of 35S promoter^. A combination of a trimer 
of upstream activating sequences of octopine synthase 
promoter and mannopine synthase promoter led to 156- 
fold improvement over 35S promoter^^ and expression 
was found to be more uniform. 

State-specific promoters 

The life cyle of a plant includes both gameiophytic and 
sporophytic phases of development, each characterized 
by a unique spatial and temporal programme. In . addi- 
tion, due to the sessile nature, plants have to respond 
rapidly to environmental signals. Sometimes, plants 
adopt signals like light to control their developmental 
programme. All this requires a complex array of regula- 
tory elements responding to signals like light, heat, 
cold, hypoxia, dehydration, hormones, wounding or to 
cues emanating from the pattern of plant development, 
and differentiation. Characterization of several state- 
specific genes has been followed by analysis of their 
promoters to determine the ctj-acting elements involved 
in regulated expression of genes^ Some of these are 
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being discussed below to illustrate the nature of ele- 
ments and interactions involved. 

Organ-specific promoters: Such promoters confer ex- 
pression .specific to an organ and show a well-defined 
temporal expression panem. Organ-specific promoters 
have been isolated from genes for seed storage proteins, 
lectins, trypsin inhibitors, etc. It has been found that in 
most of the cases the first 200 bp are able to confer 
specificity of expression. However, distal regions may 
also have redundant quality control motifs or quantita- 
tive elements. In dicots, some elements, like the three 
tandem repeats of the 22 bp element coniainins TGACG 
motif, can confer ^3n the 35S minimal promoter (-46 bp) 
seed-specific expression in a heterologous transgenic 
sysiem^^. Some elements present in seed-specific pro- 
moters like the G box (CACGTG), E box (CACCTG), 
legumin or RY motif work in combination with other 
elements to confer quantitatively high seed-specific 
expression. Other elements from the distal region have 
also been identified to work as quantitative elements, 
These include the AT-rich motifs acting as positive 
elements or the AG A A (A/C)A and CA-repeat motifs 
acting as negative elements. 

In monocots also, seed-specific promoters and regula- 
tory elements have been identified first in heterologoiis 
transgenic dicoi system and more recently in homolo- 
gous monocot systems. Promoters for slutelin genes 
from rice have- been fonnd rn contain the pro 1 ami n box 
[TG(t/a/c)AAA(g/t)], GCN4 motif (5'-TGAGTCA-3'), 
ACGT motif and AACA motif (5'-AACAAACT 
CAATC-S'). A 45 bp GluA-3 promoter fragment con- 
tains a GCN4 motif acting as a positive regulatory ele- 
ment while the AACA motif acts as negative regulatory 
element in tissues other than endosperm^®. Deletion of 
these motifs in GluBd promoter results in loss of sccd- 
specificity and a multimer of 21 bp containing GCN4 
can confer endosperm-specific expression in transgenic 
rice on ^6 bp 555 promotcr^^. Further, it has been 
found that several of these elements are present in mul- 
tiples and show functional redundancy. It may, how- 
ever, be noted that the other three elements are not able 
to direct expression even in multiple copies and a com- 
binatorial interplay may be responsible for optimal ex- 
pression^". Another class of promoters specific to seeds 
includes those controlling expression of.jeiis in maize. 
The most important motif identified is called the 
opaque-2 binding site (5'-TTGACGTGG-3') \yhich con- 
trols the expression of genes in developing endosperm. 
This site acts in conjunction with the prolamin box in 
establishing expression of 22 kDa zein protein^. 

Environment-responsive promoters: Extrinsic cue^ like 
light play an important role in the life of a plant. Light- 
regulated promoters have been found to be present in a 
large number of genes encoding proteins related to pho- 
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tbsynthesis, metabolic enzymes and developmental 
processes from seed germination to flowcring^'^'. In the 
life of a plant, light is a very important signal and it 
manifests through multiple forms of photoreceptors and 
a complex signal transduction cascade^^. Among the 
light-regulaled promoters, lighl-regulated elemciils of 
varying length have been found to act as positive regu- 
latory elements or negative regulatory elements and 
show a high degree of variability and redundancy. In 
RbcS genes, the first few hundred base pairs of the pro- 
moters have been found to be sufficient for light regula- 
tion"'^^ and, in some cases, such elements worked as 
'enhancer-like elements'*^. Further work has delineated 
elements responding to specific photoreceptors. Phyio- 
chrome-responsive elements range from 10 to 40 bp in 
case of Cab genes, A fragment of Cab gene promoter 
(-176 to -36 bp) from tobacco could confer VLFR» LFR 
and HIR of phytochrome on the minimal CaMV 35S 
promoter^^ Similar examples have been reported for 
blue -light-responsive elements. 

An analysis of light-regulated promoter sequences has 
predicted 30 distinct conserved DNA module array s^^. 
These include GT-boxes, I-boxes, G-boxes and AT-rich 
sequences^'. Despite the fact that each of these elements 
has been shown to influence the response of a promoter 
to light, no single element has been shown to confer 
light-responsiveness on a minimal promoter. While a 
letramer of GT-1 site could confer light regulation on 
CaMV 35S (-90 bp), promoter containing as-J element, 
it failed to do so with CaMV 35S (-46 bp) promoter^ '. 
Analysis of several other promoters shows randomness 
about the presence of identified elements^* ''. Some of 
these elements like G-box have been found to be pre- 
sent also in non-light-responsive promoters and others 
like the H-box from chalcone synthase gene may re- 
spond to light as well as an elicitor. Such results point 
towards the combinatorial nature of light-regulated 
promoters involving multiple elements and proteins 
binding to them. In an effort to decipher the nature of 
combinatorial action, activity of synthetic promoters 
containing paired or single LREs in wild-type and light- 
response mutants of Arabidopsis has been investi- 
gated*^ It has been observed that only paired LREs are 
able to respond to a wide spectrum of light, while single 
LREs primarily respond to a specific wavelength of 
light. 

Chemically inducible promoters 

Such promoters may be either native plant promoters 
which are induced by chemicals like nitrate, hormones 
or elicitors of wounding/pathogen attack responses or 
designer promoters involving inducible gene systems 
from heterologous organisms^* Inducible promoters 
are highly desirable for both basic and applied research 



and should be able to show substantial difference be- 
tween active and inactive states. In basic resciirch, 
analysis of gene activity without interference in plant 
processes is possible, in addition to direct correlation 
between gene activity and phenotype, without the inRu- 
ence of secondary effects and homeostasis. Such pro- 
moters can also be deployed to engineer target-specific 
traits like conditional male-sterility and resistance lo 
pests or pathogens. 

The plant promoter from the PR- la gene has been 
shown to be inducible by application of benzothiadia- 
zole or isonicotinic acid. Certain agrochemicals are 
used to increase tolerance of plants to herbicides. Such 
a plant safener-inducible promoter, In 2-2, has been 
tested in Arabidopsis^^, In addition, several other regu- 
latory systems of plants can be tailored to provide 
chemically inducible systems for plants'*^. 

Regulatory elements from heterologous systems may 
be unique and need not respond to chemicals commonly 
capable of eliciting a response in plants. Such promot- 
ers can be made to act in plants if rrflfij-acting factors 
interacting with promoters in a chemically inducible 
manner are also transferred to plant systems. 

In an effort to tailor a tetracycline-inducible promoter 
for plants, the bacterial repressor protein (TetR) gene 
was introduced into plants under the control of CaMV 
35s promoter. This binds to tet operator in the absence 
to tetracyclines. The target promoter was engineered to 
contain tet operator sites flanking TATA box in CaMV 
35S promoter. Application of tetracycline abolished 
interaction of tet operator-repressor, thereby resulting in 
500- to 800 -fold induction^. Alternatively, a tetracy- 
cline-inactivatable system made use of TetR and the 
acidic domain of Herpes simplex protein 16 (VP 16) fu- 
sion (tTA) expression under the control of CaMV 35S 
promoter. The tTA interacted vi^th the synthetic pro- 
moter containing several tet operators upstream of 
TATA box and resulted in high expression of the re- 
porter gene. Application of tetracycline abolished tTA- 
promotcr interaction and expression effcctjvply^'''''\ 
Similarly, steroid-induciblc promoters made use of the 
glucocorticoid receptor binding domain fused to plant 
transcription factors and its binding sites to yield a sys- 
tem which could be activated by application of the f;lu- 
cocorticoid dexamethasone^^. A frirther advance was 
made by fusing the glucocorticoid receptor ligand- 
binding domain to DNA-binding domain of yeast GAL4 
and the VP16 activation domain and targeting it to a 
promoter containing six GAL4 binding sites^^. In an- 
other development, a certified agrochemical RH5992 
inducible system makes use of ligand-binding domain 
of the ecdysone receptor from insects^*. A novel induc- 
ible system makes use of control elements from copper- 
inducible genes in yea.<!t and transcription factor ACEl. 
The chimeric promoter containing CaMV 35S minimal 
promoter could be induced SO-fold on application of 
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coppe^'^^ More recently, an alcohol-inducibie system 
has made use of a regulator element from alcA promoter 
of Aspergillus nidulans in combination with minimal 
CaMV 35S promoter and the AlcR transcription fac- 
tor^°. The system has been reported to work on applica- 
tion of 0.1% alcohol in hydroponic culture and by soil 
drenching with 1% aicohoL It is hoped that such efforts 
would lead to efficient inducible and inactivatablc sys- 
tems which could be further made target-specific in 
combination with organ-specific or temporal elements 
from plant systems. 

Plant RNA polymerases and basal transcription 
factors 

In plants, early efforts related to biochemical and im- 
munological characterization of RNA polymerases, 
which like their counterparts in animals, turned out to 
be mullimeric enzyme complexes*^"^^ Interestingly, in 
monocots snRNAs arc transcribed by both RNA poly- 
merase II and III, rather than the latter alone, and their 
genes contain TATA box at -30 bp and an upstream 
element. The distance of the upstream element to TATA 
box determines the specificity, i.e. 3 or 4 helical turns 
for RNA polymerase 11 or HI, respectively"*'- In 1990, 
genes for the largest subunit of RNA polymerase II 
were cloned from Arabidopsis and soybean^^' \ The 
polypeptide is encoded by a single copy gene in Arabi" 
dopsis and from a multigene family in soybean. The 
deduced amino acid sequence shows eight regions of 
similarity with its counterparts and the order of Zn ^- 
fmgcr, the CB-amanitin binding motif and CTD (with 41 
repeats) is conserved. Recently, a detailed analysis of the 
complex of DNA and a RNA polymerase I holoenzyme 
activity from Brassica has revealed that rRNA gene pro- 
moters can interact with the holoenzyme activity in a 
single DNA-binding cvent^^ Such an evidence suggests 
that RNA polymerase I holoenzyme exists in a form that 
is self-sufficient for its activity on rRNA gene promot- 
ers. It correlates with the view that RNA polymerase 
holoenzymes are recruited rather than assembled ^ 

To function effectively, plant RNA polymerases also 
require basal transcription factors. TFIID is considered 
important for the recruitment of RNA polymerase II. 
Interestingly, its crucial component, TBP, is important 
not only for RNA polymerase II but also for two other 
RNA polymerases. The gene for TBP from Arabidopsis 
was amongst the first to be cloned from a two-gene 
family", followed by that from potato and maize. The 
gene codes for a polypeptide of about 180 amino acid 
residues, showing conservation with the C-tenninal re- 
gions of polypeptides from other sources. Further, co- 
crystallization of Arabidopsis TBP and a TATA box 
sequence from the adenovirus major late promoter 
shows thit TBP binds to the minor groove of DNA, 



causing major distortions in the contacted region . The 
polypcpude takes the form of a saddle-shaped structure 
from the organization of the ten-stranded anti-par^llcl 
/3-sheeLs and four a-helices lying on its upper surface. 
On either side of the saddle are present anti-parallel 
pairs of ^-strands giving a stirrup-like structure , TBP 
interacts with several other basal transcription factors, 
coacrivators and activators to regulate gene expression. 
Evidence for the presence of such coaclivators and 
TBP^associated factors (TAFs) in wheat has been pro- 
vided recently^*. In view of the availability of cDNAs 
for TFIIB from plants", such an interaction for TBP- 
TFlIB was evaluated in maize cells and amino acids 
from the 'stirrups' {E-144; E-146) were found to be cs^ 
sential for TFIIB binding in vitro. Such interactions 
were dispensable for basal transcription and also in case 
of constitutive promoters (i55. Ubiquitln from mai/c), 
but they help significantly in regulated expression" . 

TFIIA from wheat has been characterized and found 
to be similar to the human factor in action with both 
plant and animal viral p^omole^s^^ Recently, Arabidop- 
sis has been shown to contain two genes for ihe large 
subunit and one for the small subunit of TFIIA^^ The 
large subunit interacts with the small subunit via it,*; 
terminal pari and TFIIA also interacts with TBP- DKA 
complex. The TFIIA is functionally capable of stiniulat- 
ing transcription in yeast and plant cells and the activity 
is localized in the evolutionarily unconserved central 
region of the polypeptide which could be delimited to a 
35-amino acid segment. 

One major limiting factor in the progress of work on 
RNA polymerases in plants is the lack of an appropriate 
and efficient in vitro transcription system, despite sev- 
eral efforts^^ Although a system for transcription of 
RNA polymerase I-dependent genes shows species 
specificity, its efficiency is very poor, -0.001 compared 
to 1.4-4 per template for HeLa cell extracts. For RNA 
polymerase II, an efficient and faithful system is highly 
desirable. 

Regulatory proteins 

Besides RNA polymerases and basal transcription fac- 
tors, regulated expression of genes also involves several 
regulatory proteins. An estimate based on the genome 
sequence ot Arabidopsis has revealed that 15% proteins 
in plants may be involved in transcription". Some regu- 
latory proteins have been identified that are capable «f 
remodelling chromatin architecture which in turn facili- 
tates subsequent steps of transcription. Histone acetyl a- 
tion is a common event in this direction and can lead to 
unfolding of the nucleosome and transcription also in 
plants. Similarly, the activity of histone deacetylases, 
also found to be present in plants, may have the reverse; 
effect. Recently, genes for histone deacetylascs have 
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been isolated from Arabidopsis and shown to be impor- 
tant in plant gene expression and development^^ Other 
regulatory proteins bind to cij-aciing cIcmcnLs present 
in the promoler and act directly on the transcription 
machinery or through protein-protein interaction via 
coaciivacors. A large number of such protein? and their 
genes have been identified by gcnetic/muiant studies, 
affinity of proteins to C(5-acting elements or simply by 
homology to already known proteins^. Their functional 
activity in terms of DNA binding or transcription acti- 
vation has been analysed by in vitro fin vivo assays, in- 
cluding transgenic systems. In addition, transcription 
factors are known to have naclear localization signals, 
oligomerization domains as also other domains of still 
unspecified functions (Figure 2). 

Most of the plant regulatory factors have been classi- 
fied on the basis of conserved domains which also show 
similarity to conserved domains of other organisms. The 
basic-region leucine-zipper (bZTP) proteins contain a 
basic amino acid-rich region and a leucine-rich region 
capable of forming a zipper during oligomerization. In 
plants, genes for TGAl factor and 02 factor were iso- 
lated by screening cDNA libraries with binding oli- 
gonucleotides*'' and by mutant analysis^", respectively. 
Another group of proteins, bHLH, containing a cluster 
of basic amino acids along with a helix-loop-helix is 
represented by the R/B family of maize and also acts by 
oligomerization^'. The zinc finger motif is organized 
around a zinc ion with the help of cysteine and/or his- 
tidine residues and it may have different configurations 
identified as C2H2, CjH; C2C2, R-ING finger and LIM 
finger'^. The homcodomain contains about 60 amino 
acids organized into three or four ce-helices of which the 
terminal two helices form a helix-larn-helix (HTH) 
DNA-binding motif^'*. Analysis of CI gene from maize 
revealed presence of MYB-like proteins in plants'*. 
Subsequently, more than 150 plant MYB-like proteins 
have been isolated which contain only one or two HTH 
motifs, instead of three Such motifs found in animals. It 
is assumed that animal and plant MYB-like proteins 




Figure 2* Transcrip clonal factors have various functional domains 
(violci, DNA-binding domain: hatched box. oligomerization domain; 
laddered box. regulation domain and NLS. nuclear localization sig- 
nalb) as well he regions (piok) of undefined function (aflcr Liu 



containing three motifs existed before divergence ol the 
two groups and subsequently MYB-like proteins with 
one or two motifs evolved to predominantly regulate 
plant-specific processes^^, A unique trihclix DNA- 
binding motif containing basic, acidic tind proline/ 
glutamine-rich regions has been identified in GT box- 
binding p^oteins'^ 

Two proteins involved in flower developmeiii, 
AGAMOUS and DEFICIENS were found to have a con- 
served domain of about 57 amino acids forming a long 
of-helix and two j3-strands similar to MCMI from yeasr 
and human SRF"'^^. This domain is known as the 
MADS box and several MADS box-containing regula- 
tory proteins have been identified from plants. In addi- 
tion, transcriptional proteins from plants are known 10 
contain conserved motifs like AT-hook motif, HMG- 
box. AP2/EREBP. B3. ARF^ It may be mentioned thai 
instead of an overall amino acid sequence conscrvatitin, 
various motifs show more conservation in secondary 
structure and location of specific amino acids. 

The regulatory proteins have been shown to contain 
activation and repression domains which are effective- 
across taxonomic borders. Variability in the regulatory 
domains may allow proteins with similar DNA-binding 
domains to act in a unique manner. Yeast GAL4 and 
Herpes simplex virus VP16 activation domains have 
been shown 10 be functional in planl.s'''"''''^. Further, 
acidic domain of maize CI proLcin^^ and prolinc-rich 
region of GBFl (ref 81) also represent typical moiir^s 
known to be involved in regulatory domainal of eu- 
karyotic transcription factors. While such activation 
domains are variable in primary sequence, the GCB 
motif of HBP-la/GBF shows a high degree of cc^nscrVi'j- 
tion in consensus sequence^^. On the other hand, the rt.ilc 
of unique amino acids at selective- positions in the aeiivn- 
tion domain or its modular structure^' ha.s also been 
revealed. Evidence also exists for the presence of repres- 
sor domains since N-terminal domain of R0M2 (regula- 
tor of maturation-specific protein) has bi:c:n shown ui 
inhibit PvAF-aciivated transcription of phyiohemagglu- 
tinin gene DLEC2 from French bean^'\ Precious little is 
known about the target of regulatory proteins. Rcc^nlly, 
GT-1 has been shown 10 interact with TFIIA-TB?- 
TATA complex*'*. It is obvious that much remains ro be 
learned about the nature of regulatory domains, parlicu* 
larly in plants, and their sites of action. 

In several instances^ the quantity and availability uf 
regulatory proteins may depend on their own expression 
patterns. Such controls may be exerted al transcrip- 
tional, post-transcriptional or translational levels'. The 
translated products should be able to reach the nucleus 
in order to be effective in transcription. Translocation of 
several regulatory factors is known to be aclivcited in a 
stimuli-dependent manner**'. Plant regulatory factors arc 
also known to move from cell to cell via plasmodesmata 
to exert their effects^^ 
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In recent years, phosphorylation of regulatory pro- 
teins has emerged as a major event controlling the gene 
regulation in eukaryotes. Phosphorylation of plant regu- 
latory factors may cause stimulation of bind- 
ing/activaiion**^'^*^ or inactivation^''^ of target genes. So 
far. only casein kinase II and serine kinases have been 
shown to be involved, although others may also be ac- 
tive. Further, it has been shown that phosphorylation 
may be effected by different cues like calcium"'^^ or 
circadian-clQck^*. 

Multiplicity and redundancy of regulatory factors and 
their recognition sites raises the obvious *Jig-saw puz- 
zle' scenario and one wonders about the fundamental 
rules which establish regulation of gene expression. It is 
possible that concentration of a regulatory factor and its 
avidity to cognate DNA sequence establish a unique 
microenvironmenl for specific interactions. The combi- 
natorial control by multiple DNA-binding regulatory 
proteins or association with coactivators can also pro- 
vide specificity in action*'*^. An interesting example of 
combinatorial light-dependent regulation has been re- 
vealed by the work of Mariinez-Garcia et ai/''*. It has 
been shown that a hellx-lcop-helix transcription factor 
PIF3 binds to G-box motif of light-regulated MYB-type 
transcription factor. The phoioreceptor, phyiochrome 
binds to DNA-PIF3 complex photorcversibly. in its 
biologically active conformation. Since phytochromc is 
also known to have integral kinase activity, it is pcssi- 

Ul(^ Lliai piljr IWM* V<»»^ •* — — - 

nicate with the transcriptional complex and to activate 
red*light^dependent expression (Figure 3). It is not be- 
yond comprehension that light-activated MYB tran- 
scription factor can activate secondary genes in a light- 
dependent manner. 

Since activity of regulatory factors controls down- 
stream genes responsible to confer a particular pheno- 
type on a plant or its response to extrinsic influences, it 
would be desirable to engineer regulatory factor genes 
for crop improvement^^. Several primary and secondary 
metabolic pathways can be controlled by engineering 
key regulatory factors affecting transcription of genes 
for enzymes involved'^'^'. Already, expression of 
LEAFY under the control of CaMV 35S promoter has 
been shown to result in precocious flowering in aspen 
which takes 8-10 years to flower^^ and a new gene en- 
coding a repressor of floral transition has been isolated 
recently'^. Thus, an important trait like flowering time 
can be manipulated genetically. Manipulation of floral 
organs is possible by altering expression of horaeotic 
genes in plants. One wonders if Crocus flowers can be 
made to produce more stigmatic tissue to obtain precious 
saffron or in rice the number of grain.*? per spikelet could 
be increased by manipulating genes like ZA^M'°°. Even 
seed production has been shown to increase by over 
expression of HAT4, a homeodomain zipper prolein*^^ 
Dwarf varieties can be produced by manipulating 



Figure 3. The photorcctpior, phytochrome, gets cnnvcrtcd lu I'lV 
form on absorbing red light and moves from the cytosol to the nu- 
cleus- fn the nucleus, it interacts with G-box bound PIF3 (a himic 
helix-loop'hclix imnscription factor) on a lighi-rcgulntcd proinoK:!', 
This results in activation of genes cncodirtg MYB-type transcription 
factors {CCAl, LHY) which in turn can activate sccondiiry liglu 
rcguiaccci genes. Exposure to i:ar-rcd light converts Fir lOfm lO l-r 
form, which gets released from the transcriptional complex :intl shiiEs 
down gene expression (after Marti nez-Garcia et ai.'**). 



regulatory factors responding to the growth hormone 
gibberellin^''^ Recently, genes for regulatory factors 
have also been deployed to confer abiotic stress toler- 
ance, with a view to stimulate expression of several 
downstream genes involved in conferring such a com- 
plex phenotype*°^ '^. It, therefore, seems that genetic 
manipulation by deploying regulatory factor genes is 
becoming a reality. 



Conclusions and prospects 

In last two decades, several promoters active in cells oT 
dicot and monocot plants have been identified which are 
regulated by intrinsic and extrinsic stimuli. Functional 
genomics of Arabidopsis and rice, by way of promoter 
or enhancer traps, is likely to increase the repertoire of 
such elements to a great cxtent**'^'^^*^. Simultaneous 
work on regulatory factors has led to identification of 
major structural featur&s, including DNA binding and 
activation/repression domains. While the general theme 
of regulation across eukaryotes is represented by activa- 
tion of genes, the signals and components for plants 
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might be specific, particularly due to the uniqueness of 
signals like light or plant hormones. Plants may also use 
mechanisms like genomic imprinting to regulate activity 
of genes at a genome-wide level as has been recently 
shown by the example of delayed activation of the pa- 
ternal genome after fertilization in Arabidopsis^^ . It has 
been shown already that target-specific regulatory ele- 
ments can be deployed for engineering traits like male- 
stcrility. Genes for regulatory proteins seem to hold 
greater promise for crop improvement, particularly for 
traits involving multiple downstream genes. With the 
emerging scenario in the area of transgenics and genom- 
ics, there is need to focus on regulation biology, both 
for the sake of its novelty as well as application. 
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